A study on temporal features derived by analytic signal
نویسندگان
چکیده
Traditional feature extraction methods for automatic speech recognition (ASR), such as MFCC (Mel-frequency cepstral coefficients) and PLP (perceptual linear prediction) [6], are extracted from short-term spectral envelopes and can be used to realize promising ASR systems. On the other hand, features extracted by TRAPs-like classifiers [2] are based on long-term envelopes of narrow-band signals. These two forms of feature extractions use a mutual representation of energy in narrow band signals. We have developed a feature extraction system that depends on not only the energy but also the modulation of carrier signals. Carrier signals involve attributes such as the spectral centroid, spectral gradient, number of zero-crossing points, and frequency modulation. Some experiments show that not only the spectral envelope and its modulation but also the zero-crossing points and frequency modulation form a significant portion of human speech perception [4]. In this study, we propose a method of carrier analysis, evaluate this method, and discuss the effectiveness of carrier analysis for ASR. Our method can reduce the phoneme error rate from 45.7% to 38.6%.
منابع مشابه
Proper integration time of polarization signals of internetwork regions using Sunrise/IMaX data
Distribution of magnetic fields in the quiet-Sun internetwork areas has been affected by weak polarization (in particular Stokes Q and U) signals. To improve the signal-to-noise ratio (SNR) of the weak polarization signals, several approaches, including temporal integrations, have been proposed in the literature. In this study, we aim to investigate a proper temporal-integration time with which...
متن کاملPrediction of Epileptic Seizures in Patients with Temporal Lobe Epilepsy (TLE) based on Cepstrum analysis and AR model of EEG signal
Epilepsy is a chronic disorder of brain function caused by abnormal and excessive electrical neurons discharge in the brain. Seizures cause disturbances in consciousness that occur without prior notice, so their prediction ability, based on EEG data, can reduce stress and improve quality of life. An epileptic patient EEG data consists of five parts: Ictal, Inter-Ictal, pre-Ictal, Post-Ictal, an...
متن کاملChoosing the Distinguishing Frequency Feature of People Addicted to Heroin from Healthy while Resting
Addiction is a biological, psychological, and social disease. Several factors are involved in etiology, substance abuse, and addiction which interact with each other and lead to the beginning of drug use and then addiction. Heroin is an addictive drug that, by acting on the central nervous system, reduces the density of neurons in the brain and interferes with decision making. This paper examin...
متن کاملRecognition of Visual Events using Spatio-Temporal Information of the Video Signal
Recognition of visual events as a video analysis task has become popular in machine learning community. While the traditional approaches for detection of video events have been used for a long time, the recently evolved deep learning based methods have revolutionized this area. They have enabled event recognition systems to achieve detection rates which were not reachable by traditional approac...
متن کاملSelecting effective features from Phonocardiography by Genetic Algorithm based on Pearson`s Coefficients Correlation
The heart is one of the most important organs in the body, which is responsible for pumping blood into the valvular systems. Beside, heart valve disorders are one of the leading causes of death in the world. These disorders are complications in the heart valves that cause the valves to deform or damage, and as a result, the sounds caused by their opening and closing compared to a healthy heart....
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007